Somaliland
- Asia > Middle East > Israel (0.59)
- Africa > Middle East > Somaliland (0.51)
- Asia > Middle East > Yemen (0.15)
- (23 more...)
- Media > News (1.00)
- Government > Regional Government > North America Government > United States Government (0.98)
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
Belay, Tadesse Destaw, Azime, Israel Abebe, Ayele, Abinew Ali, Sidorov, Grigori, Klakow, Dietrich, Slusallek, Philipp, Kolesnikova, Olga, Yimam, Seid Muhie
Large Language Models (LLMs) show promising learning and reasoning abilities. Compared to other NLP tasks, multilingual and multi-label emotion evaluation tasks are under-explored in LLMs. In this paper, we present EthioEmo, a multi-label emotion classification dataset for four Ethiopian languages, namely, Amharic (amh), Afan Oromo (orm), Somali (som), and Tigrinya (tir). We perform extensive experiments with an additional English multi-label emotion dataset from SemEval 2018 Task 1. Our evaluation includes encoder-only, encoder-decoder, and decoder-only language models. We compare zero and few-shot approaches of LLMs to fine-tuning smaller language models. The results show that accurate multi-label emotion classification is still insufficient even for high-resource languages such as English, and there is a large gap between the performance of high-resource and low-resource languages. The results also show varying performance levels depending on the language and model type. EthioEmo is available publicly to further improve the understanding of emotions in language models and how people convey emotions through various languages.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America > Dominican Republic (0.14)
- (27 more...)
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation
Divekar, Abhishek, Durrett, Greg
It is often desirable to distill the capabilities of large language models (LLMs) into smaller student models due to compute and memory constraints. One way to do this for classification tasks is via dataset synthesis, which can be accomplished by generating examples of each label from the LLM. Prior approaches to synthesis use few-shot prompting, which relies on the LLM's parametric knowledge to generate usable examples. However, this leads to issues of repetition, bias towards popular entities, and stylistic differences from human text. In this work, we propose Synthesize by Retrieval and Refinement (SynthesizRR), which uses retrieval augmentation to introduce variety into the dataset synthesis process: as retrieved passages vary, the LLM is seeded with different content to generate its examples. We empirically study the synthesis of six datasets, covering topic classification, sentiment analysis, tone detection, and humor, requiring complex synthesis strategies. We find that SynthesizRR greatly improves lexical and semantic diversity, similarity to human-written text, and distillation performance, when compared to 32-shot prompting and four prior approaches. We release our extensive codebase at https://github.com/amazon-science/synthesizrr
- Asia > Russia (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > Canada > Ontario > Toronto (0.04)
- (75 more...)
Addis summit raises questions about AU's muted stance on Ethiopia rifts
From Thursday, African leaders will gather in the Ethiopian capital, Addis Ababa, home of the African Union (AU), for the continental body's annual summit. According to AU Commission Chairperson Moussa Faki Mahamat, regional integration and "maintaining momentum in addressing issues of peace and security" is high on the agenda. But in an ironic twist, the host of the summit has either initiated or been involved in multiple conflicts in the last three years. Ethiopia's two-year civil war with the state of Tigray may have ended in November 2022 after a Pretoria pact, but federal troops are currently upping drone strikes against rebels known as Fano militia in the state of Amhara, next door to Tigray. This week, the Ethiopian Human Rights Council said "at least 45 civilians" had been killed by federal troops in Amhara.
- North America > United States (0.49)
- Africa > Ethiopia > Addis Ababa > Addis Ababa (0.29)
- Africa > South Africa > Gauteng > Pretoria (0.26)
- (11 more...)
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Struppek, Lukas (a:1:{s:5:"en_US";s:33:"Technical University of Darmstadt";}) | Hintersdorf, Dom (Technical University of Darmstadt) | Friedrich, Felix (Technical University of Darmstadt) | br, Manuel (Technical University of Darmstadt) | Schramowski, Patrick (Technical University of Darmstadt) | Kersting, Kristian (Technical University of Darmstadt)
Models for text-to-image synthesis, such as DALL-E 2 and Stable Diffusion, have recently drawn a lot of interest from academia and the general public. These models are capable of producing high-quality images that depict a variety of concepts and styles when conditioned on textual descriptions. However, these models adopt cultural characteristics associated with specific Unicode scripts from their vast amount of training data, which may not be immediately apparent. We show that by simply inserting single non-Latin characters in the textual description, common models reflect cultural biases in their generated images. We analyze this behavior both qualitatively and quantitatively and identify a model's text encoder as the root cause of the phenomenon. Such behavior can be interpreted as a model feature, offering users a simple way to customize the image generation and reflect their own cultural background. Yet, malicious users or service providers may also try to intentionally bias the image generation. One goal might be to create racist stereotypes by replacing Latin characters with similarly-looking characters from non-Latin scripts, so-called homoglyphs. To mitigate such unnoticed script attacks, we propose a novel homoglyph unlearning method to fine-tune a text encoder, making it robust against homoglyph manipulations.
- Europe > Greece (0.14)
- North America > United States (0.14)
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
- (37 more...)
- Research Report > New Finding (1.00)
- Overview (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (0.66)
RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization
Rubio, Mateo Dulce, Zeng, Siqi, Wang, Qi, Alvarado, Didier, Moreno, Francisco, Heidari, Hoda, Fang, Fei
Landmines remain a threat to war-affected communities for years after conflicts have ended, partly due to the laborious nature of demining tasks. Humanitarian demining operations begin by collecting relevant information from the sites to be cleared, which is then analyzed by human experts to determine the potential risk of remaining landmines. In this paper, we propose RELand system to support these tasks, which consists of three major components. We (1) provide general feature engineering and label assigning guidelines to enhance datasets for landmine risk modeling, which are widely applicable to global demining routines, (2) formulate landmine presence as a classification problem and design a novel interpretable model based on sparse feature masking and invariant risk minimization, and run extensive evaluation under proper protocols that resemble real-world demining operations to show a significant improvement over the state-of-the-art, and (3) build an interactive web interface to suggest priority areas for demining organizations. We are currently collaborating with a humanitarian demining NGO in Colombia that is using our system as part of their field operations in two areas recently prioritized for demining.
- South America > Colombia > Bolivar Department (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > United Kingdom > North Sea > Central North Sea (0.04)
- (13 more...)
Context-faithful Prompting for Large Language Models
Zhou, Wenxuan, Zhang, Sheng, Poon, Hoifung, Chen, Muhao
Large language models (LLMs) encode parametric knowledge about world facts and have shown remarkable performance in knowledge-driven NLP tasks. However, their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks (e.g., knowledge acquisition tasks). In this paper, we seek to assess and enhance LLMs' contextual faithfulness in two aspects: knowledge conflict and prediction with abstention. We demonstrate that LLMs' faithfulness can be significantly improved using carefully designed prompting strategies. In particular, we identify opinion-based prompts and counterfactual demonstrations as the most effective methods. Opinion-based prompts reframe the context as a narrator's statement and inquire about the narrator's opinions, while counterfactual demonstrations use instances containing false facts to improve faithfulness in knowledge conflict situations. Neither technique requires additional training. We conduct experiments on three datasets of two standard NLP tasks, machine reading comprehension and relation extraction, and the results demonstrate significant improvement in faithfulness to contexts. Code and data are released at https://github.com/wzhouad/context-faithful-llm.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Washington > King County > Seattle (0.14)
- North America > United States > Texas > Harris County > Houston (0.14)
- (17 more...)
- Education (0.48)
- Leisure & Entertainment (0.46)
- Government (0.46)
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Struppek, Lukas, Hintersdorf, Dominik, Friedrich, Felix, Brack, Manuel, Schramowski, Patrick, Kersting, Kristian
Models for text-to-image synthesis, such as DALL-E~2 and Stable Diffusion, have recently drawn a lot of interest from academia and the general public. These models are capable of producing high-quality images that depict a variety of concepts and styles when conditioned on textual descriptions. However, these models adopt cultural characteristics associated with specific Unicode scripts from their vast amount of training data, which may not be immediately apparent. We show that by simply inserting single non-Latin characters in a textual description, common models reflect cultural stereotypes and biases in their generated images. We analyze this behavior both qualitatively and quantitatively, and identify a model's text encoder as the root cause of the phenomenon. Additionally, malicious users or service providers may try to intentionally bias the image generation to create racist stereotypes by replacing Latin characters with similarly-looking characters from non-Latin scripts, so-called homoglyphs. To mitigate such unnoticed script attacks, we propose a novel homoglyph unlearning method to fine-tune a text encoder, making it robust against homoglyph manipulations.
- Europe > Greece (0.14)
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
- North America > United States > New York (0.04)
- (35 more...)
- Overview (1.00)
- Research Report > New Finding (0.46)
- Information Technology > Security & Privacy (1.00)
- Law (0.87)
A Mixed-Method Approach to Determining Contact Matrices in the Cox's Bazar Refugee Settlement
Walker, Joseph, Aylett-Bullock, Joseph, Shi, Difu, Maina, Allen Gidraf Kahindo, Evers, Egmond Samir, Harlass, Sandra, Krauss, Frank
Contact matrices are an important ingredient in age-structured epidemic models to inform the simulated spread of the disease between sub-groups of the population. These matrices are generally derived using resource-intensive diary-based surveys and few exist in the Global South or tailored to vulnerable populations. In particular, no contact matrices exist for refugee settlements - locations under-served by epidemic models in general. In this paper we present a novel, mixed-method approach, for deriving contact matrices in populations which combines a lightweight, rapidly deployable, survey with an agent-based model of the population informed by census and behavioural data. We use this method to derive the first set of contact matrices for the Cox's Bazar refugee settlement in Bangladesh. The matrices from the refugee settlement show strong banding effects due to different age cut-offs in attendance at certain venues, such as distribution centres and religious sites, as well as the important contribution of the demographic profile of the settlement which was encoded in the model. These can have significant implications to the modelled disease dynamics. To validate our approach, we also apply our method to the population of the UK and compare our derived matrices against well-known contact matrices previously collected using traditional approaches. Overall, our findings demonstrate that our mixed-method approach can address some of the challenges of both the traditional and previously proposed agent-based approaches to deriving contact matrices, and has the potential to be rolled-out in other resource-constrained environments. This work therefore contributes to a broader aim of developing new methods and mechanisms of data collection for modelling disease spread in refugee and IDP settlements and better serving these vulnerable communities.
- Asia > Bangladesh (0.24)
- Europe > Sweden (0.14)
- Oceania > Australia > Australian Capital Territory > Canberra (0.04)
- (9 more...)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Government (1.00)
- (2 more...)
The Internet of the Orals
Internet services like social media, online discussion forums, and crowdsourcing marketplaces have transformed how people participate in the information ecology and digital economy. These services empower mostly urban, affluent, and literate people, and improve their reach to information and instrumental needs. However, these services currently exclude billions of people worldwide who are too poor to afford Internet-enabled devices, too remote to access the Internet, or too low literate to navigate the mostly text-driven Internet. In India and Pakistan alone, there are nearly 1.1 billion people offline. Although 70% of their populations have access to mobile phones, most people still use basic or feature phones, making it difficult to extend existing Internet services on these devices running custom operating systems.
- North America > United States > Georgia > Fulton County > Atlanta (0.05)
- North America > Canada > Quebec > Montreal (0.05)
- Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.05)
- (11 more...)
- Health & Medicine > Public Health (0.95)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.30)
- Health & Medicine > Therapeutic Area > Immunology (0.30)